Method Designed to Respect Molecular Heterogeneity Can Profoundly Correct Present Data Interpretations for Genome-Wide Expression Analysis
نویسندگان
چکیده
Although genome-wide expression analysis has become a routine tool for gaining insight into molecular mechanisms, extraction of information remains a major challenge. It has been unclear why standard statistical methods, such as the t-test and ANOVA, often lead to low levels of reproducibility, how likely applying fold-change cutoffs to enhance reproducibility is to miss key signals, and how adversely using such methods has affected data interpretations. We broadly examined expression data to investigate the reproducibility problem and discovered that molecular heterogeneity, a biological property of genetically different samples, has been improperly handled by the statistical methods. Here we give a mathematical description of the discovery and report the development of a statistical method, named HTA, for better handling molecular heterogeneity. We broadly demonstrate the improved sensitivity and specificity of HTA over the conventional methods and show that using fold-change cutoffs has lost much information. We illustrate the especial usefulness of HTA for heterogeneous diseases, by applying it to existing data sets of schizophrenia, bipolar disorder and Parkinson's disease, and show it can abundantly and reproducibly uncover disease signatures not previously detectable. Based on 156 biological data sets, we estimate that the methodological issue has affected over 96% of expression studies and that HTA can profoundly correct 86% of the affected data interpretations. The methodological advancement can better facilitate systems understandings of biological processes, render biological inferences that are more reliable than they have hitherto been and engender translational medical applications, such as identifying diagnostic biomarkers and drug prediction, which are more robust.
منابع مشابه
Human Cancer Modeling: Recapitulating Tumor Heterogeneity Towards Personalized Medicine
Despite diagnostic, preventive and therapeutic advances, growing incidence of cancer and high rate of mortality among patients affected by specific cancer types indicate current clinical measures are not ideally useful in eradicating cancer. Chemoresistance and subsequent disease relapse are believed to be mainly driven by the cell-molecular heterogeneity of human tumors that necessitates perso...
متن کاملHuman Cancer Modeling: Recapitulating Tumor Heterogeneity Towards Personalized Medicine
Despite diagnostic, preventive and therapeutic advances, growing incidence of cancer and high rate of mortality among patients affected by specific cancer types indicate current clinical measures are not ideally useful in eradicating cancer. Chemoresistance and subsequent disease relapse are believed to be mainly driven by the cell-molecular heterogeneity of human tumors that necessitates perso...
متن کاملDetection of the “Tim” gene of sheep Giardia using “Tim” Gene primers of Giardia with human origin
Giardiasis is an important human parasitic disease. Giardia is a genus composed of binuclear flagellate protozoa. Giardia duodenalis is a parasitic species for a wide range of vertebrates, including humans. Heterogeneity in G. duodenalis has been shown by serological, biochemical, and molecular analysis. In the present study, the possible genetic similarity between Giardia in sheep and humansan...
متن کاملO-36: Genome Haplotyping and Detection of Meiotic Homologous Recombination Sites in Single Cells, A Generic Method for Preimplantation Genetic Diagnosis
Background: Haplotyping is invaluable not only to identify genetic variants underlying a disease or trait, but also to study evolution and population history as well as meiotic and mitotic recombination processes. Current genome-wide haplotyping methods rely on genomic DNA that is extracted from a large number of cells. Thus far random allele drop out and preferential amplification artifacts of...
متن کاملI-40: Male Genome Programming, Infertility and Cancer
Background: During male germ cells differentiation, genomewide re-organizations and highly specific programming of the male genome occur. These changes not only include the large-scale meiotic shuffling of genes, taking place in spermatocytes, but also a complete “re-packaging” of the male genome in post meiotic cells, leading to a highly compacted nucleo-protamine structure in the mature sperm...
متن کامل